# Chip Design Sign-Off For Shrinking Technology

### Anurag Sahai

Managing Director, Vertical Gravity, Noida, U.P. India anurags 1999@yahoo.com

Abstract-In today's IC's competitive market place, as technology is shrinking and we are stepping into 45nm and below, IC's complexity is increasing, die areas are also getting larger in order to incorporate the increased functionality that comes with more advanced technology. Due to shrinking design, physical verification of design such as design rule checking (DRC) and layout vs. schematic (LVS), may not give accurate result as expected due to electrical side effects inherent in nanometer process technology. Therefore, shrinking designs demand additional verification before being signed-off for manufacturing. For example, if signal integrity (SI) and other electrical effects are not controlled, a design will likely suffer from lower performance, lower yield, and even functional failure. Any chip failure after manufacturing will result in expensive mask changes and delays in getting the chip to market. Consequently, most designs today require a nanometer sign-off process, where the influence of different electrical effects on the functionality and performance of the design are analyzed before manufacturing.

#### I. COMPLEXITY DUE TO SHRINKING DESIGN

Implementing 45nm and below design raises significant SI (Signal Integrity) related issues that have to be verified in order to avoid chip functional failure. These include reduced feature sizes, decreases in wire pitch, lower power supply voltages, and shrinking threshold voltages. With each new process technology, more and more levels of wire are packed more closely together. As a consequence, the fraction of total wire capacitance represented by lateral coupling increases dramatically. This in turn is responsible for a dramatic increase in on-chip crosstalk noise. Another electrical problem in shrinking designs is increased clock frequencies with faster on chip slew rates. Faster slew rates create more switching noise and increase instantaneous power consumption. This in turn puts stress on the power grid, resulting in voltage (IR) drop and electro-migration. Nanometer sign-off is becoming a challenge, uncovering any problems that may have been missed during design implementation or that have been caused by last-minute manual design changes.

Nanometer sign-off is a final verification to assure that a design does not have electrical issues that will result in chip failure after manufacture. However, finding many problems only just before manufacturing could lead to multiple design iterations and delay in manufacturing and market.

As design is shrinking, we can observe increased power density, increased clock frequencies and slew rate, lower supply voltage, domination of wire to wire coupling capacitance over ground capacitance, smaller transistor thresholds to decrease transistor delay and because of all these factors we can see from figure 1 that crosstalk glitches are increasing and it gives result of silicon respins, longer design cycle, lower performance and lower yield.

## Increased Density and Scaling

- Wire-to-wire coupling capacitance dominates ground capacitance.
- Clock frequencies and slew rates increase.
- Lower supply voltages
- Smaller transistor thresholds







Silicon Respins

Longer Design
Cycles

Lower Performance
Lower Yield

Fig. 1 Crosstalk Trends

#### II. SHRINKING DESIGN SIGN-OFF REQUIREMENTS

The role of nanometer sign-off is to uncover any problems that may have been missed during design implementation or that have been caused by last-minute manual design changes. Nanometer sign-off is a final verification to assure that a



Fig. 2 Flow to fix signal integrity.

design does not have electrical issues that will result in chip failure after manufacture.

45 nanometer and below sign-off is best applied as part of a thorough design closure methodology that issues such as signal and power integrity from early chip planning through to tapeout, as shown in figure 2. The number of problems found during final nanometer sign-off should be small enough to be dealt efficiently, even by means of manual repair.

The tools used during design implementation to support design closure should be fast and flexible in order to maintain design productivity. These tools should guide the design's electrical integrity continuously, without major compromises to design performance, power, or area. The final sign-off phase requires an integrated tool suite that is production-proven and validated against real silicon.

Nanometer sign-off tools used during this phase should be more thorough and accurate than the design closure tools used during implementation. This full-chip examination should include analysis of the subtle and complex interactions that occur between many different electrical effects across many process corners. For example, nanometer sign-off should predict how IR drop affects crosstalk—and how this affects timing, which in turn affects power, which is responsible for IR drop. The key components of nanometer sign-off include accurate 3-D parasitic extraction, full-chip IR drop analysis, full-chip crosstalk analysis, and SI-aware static timing

analysis (STA) that accounts for the influence of both crosstalk and IR drop.

#### III. 3-D PARASITIC EXTRACTION

Extraction accuracy is largely a function of the relationship between the tool vendor and the foundry and also a function of the design representation. Tool vendors need early access to proprietary process information to determine how best to model the process and to capture the necessary process characteristics. Tools need to use the physical details of design elements, rather than simplistic abstractions. As an example, while it is expedient to provide a simplified model of a port, doing so inherently limits design optimization. Similarly, cell models should be instance-specific, not treated as if each cell were isolated.

From a methodology point of view, getting the most out of the silicon requires utilizing the most accurate necessary physical information for design iterations. For early design iterations, turnaround time is important and later in the design cycle, accuracy is most critical issue. Using a less accurate extractor to speed iterations can mean increased margin and increased timing closure risk. An extractor should be fast enough to enable designers to complete block iterations within 1-2 hours. It should also complete a full-chip extraction overnight, using multiprocessor computers if necessary to do so. The output from 3-D extraction tools is either a parasitic database or a text file in detailed specified parasitic format

(DSPF) or standard parasitic extraction format (SPEF). SPEF is required for coupling extraction, which is critical for accurate SI analysis. As design sizes increase, so do the number of parasitic elements required for accurate analysis.

#### IV. IR DROP ANALYSIS

IR drop due to parasitic effects, such as resistance or capacitance, cause a decrease in the current flowing from the VDD (source) input/output pins to the transistors or gates of a design. An excessive IR-drop can increase the delay of targeted paths (global effect) and significantly reduce the voltage reaching a device causing reason for logic failure.

The risk of IR drop increases with the shrinking process technologies of nanometer design. With each new generation of process technology, the current demand per unit area of a design is increased due primarily to shrinking gate-oxide thickness. This, combined with the fact that nanometer designs contain more transistors or gates, adds additional stress to power grid integrity.

The effect of IR drop on chip performance is significant, primarily affecting timing. IR drop compromises the drive capability of the gates and increases the overall delay. Typically, a 5% drop in supply voltage can affect delay by 15% or more. Such an increase in delay is critical when clock skews are in the range of 100 picoseconds. IR drop can make path delay unpredictable and can even cause the critical path of a design to change. For this reason, nanometer timing calculations must take IR drop into account.

IR drop has significant impact on SI. Supply voltages wink as nanometer device dimensions are scaled to avoid tensistor punch-through conditions, hot-electron effects, and tevice breakdown. This has resulted in smaller and smaller margins. IR drop reduces these margins even further, matering nanometer designs more vulnerable to SI noise tests.

A complete picture of power grid integrity can only be sined when effects such as IR drop, ground bounce, and migration are computed and analyzed accurately. See effects involve complex interactions across a chip and be analyzed at the full-chip level. However, a leading-45 nm design can contain over 10 million gates, which could produce a grid approaching one billion RC elements.

Therefore, power grid verification tools must have the mostly and performance required to analyze detailed most to such large, complex chips with reasonable mostly time.

power grid verification tools rely on mplifications, such as black-boxing embedded IP blocks simple power-consumption distribution models, thus accuracy for speed. However, because the current through a chip's power grid doesn't stop at the ports and IP blocks, power grid verification tools must be to the transistor level while maintaining the speed and





Fig. 3 Gate-level analysis doesn't accurately model IP.



Fig. 4 Shrinking design impact IR drop analysis

efficiency required for large-scale nanometer designs, as shown in Figure 3. The ability to pre-characterize the power grids of library elements facilitates the reuse strategies that are central to nanometer design styles.

In the gate-level plot (left), the IR drop in the center of the chip is fairly severe. But the transistor-accurate analysis (right) of the IP's contents and the current flowing through the IP reveals that the IR drop is actually much smaller

We need to understand General robustness of power rail design (static), Power rail electro migration (static), IR drop impact on timing (dynamic),Optimal size & location of decoupling capacitance (dynamic),Trade-off de-coupling capacitance vs leakage power (dynamic)

This figure 4 shows how de-coupling capacitors are added to the power rails to minimize IR drop transients. What happens is that the de-coupling capacitor acts as a local charge source to help charge up the load capacitor through the p-channel transistor. This local charge source helps to minimize the amount of current that is drawn through the resistive power rails, which helps minimize IR drop transients.

To understand the general robustness of the power rails, a static approach can be used. This approach will clearly identify major weaknesses in the power rails, such as routing that is too narrow and missing vias or power straps. The value of the static approach is that it is available very early in the design cycle, even prior to signal routing being completed.

The static approach is also the best solution for power rail electro migration, since electro migration is caused by high average currents flowing over a period of time.

The static approach is good down to 130nm, but beyond 130nm, a dynamic approach should be used so that the impact

of IR drop transients on timing can be analyzed. The dynamic solution also enables to understand where to add or remove de-coupling capacitors and evaluate the impact on transient IR drop and total leakage.

#### V. CROSSTALK ANALYSIS

Crosstalk analysis considers the effect of wire coupling, increase clock frequencies and slew rates, lower supply voltages, smaller transistor thresholds on the performance and functionality of a design. The primary source of wire coupling is wire-to-wire capacitances, though, for very high-speed nanometer designs, inductive coupling can also be a problem.



Fig. 5 Crosstalk functional failure

Crosstalk can also cause functional failures. In this example, coupling from attacker A causes a significant glitch on the reset signal such that it resets the flip-flop and destroys the stored logic state. This problem will not show up in logic simulation, formal verification or timing analysis. The designer is mystified. Isolating the source of a noise induced functional problem can take many months of debugging resulting in lost time to market.

Due to glitch, there is change (e.g., from logic 0 to logic 1) to appear on otherwise static signals. A glitch has a true functional impact when it propagates and is latched before the next clock cycle, as shown above. Consequently, to verify if a design has a functional problem due to crosstalk, it is not sufficient just to determine if a signal is subject to a sizeable noise pulse. The circuit must be tested to determine if that noise pulse will propagate and reach a storage element such as a latch or flip-flop. Therefore, effective sign-off crosstalk analysis must include both accurate noise glitch calculation and noise propagation.

Accurate noise glitch calculation must account for temporal and logical relationships of the aggressor signals, as well as the holding resistance of the victim driver. When a driver is holding a static logic state, its resistance is larger than when it is switching. Therefore, noise calculations using switching resistance derived from standard timing libraries over-estimate noise. A separate cell library characterization step is required to calculate holding resistance.

For accurate noise propagation, simulation of the receiver input noise is required to determine the amount of noise to propagate. Since digital CMOS logic gates are inherently noise-immune and act as low-pass filters, most glitches will be filtered before they are latched. Without accurate noise

propagation, the number of functional failures will be greatly exaggerated, while at the same time, the true crosstalk-induced functional errors easily could be overlooked. Figure 5 shows a comparison of noise failures reported by using a glitch peak threshold of 30% of supply versus using noise propagation. For these examples, noise propagation reports over 400 times fewer failures on average. For the largest example, this reduces the task of tracking down more than 10,000 reported crosstalk violations—a task that literally could cost manweeks—to a far more manageable 40 violations.



Fig. 6 Number of crosstalk violations using noise propagation vs. glitch peak.

Other types of crosstalk-induced functional violations are bootstrap noise and double clocking. Bootstrap noise failures occur when crosstalk on the input of an unbuffered latch pulls a static signal below ground (undershoot) or above supply (overshoot). In such cases, that the logic value stored in the latch can be destroyed, even when the clock is inactive. Overshoot and undershoot can also cause reliability problems by over-stressing transistors and causing time-dependent dielectric breakdown (TDDB).

Double clocking occurs when a glitch causes a clock to cross its switching threshold more than once during a single transition. In this case, even a small crosstalk glitch can cause a functional violation due to false triggering of flipflops or latches. Double clocking can be particularly troublesome for counters, state machines, and on-chip clock dividers, all of which rely on the fidelity of the clock signal.

#### Crosstalk Repair Techniques for Glitch and Delay

- Victim driver sizing, buffer insertion, aggressor downsizing
  - Rip-up aggressors and victim spacing
  - Soft spacing, victim rerouting
  - Shielding

## VI. DESIGN SIGN-OFF FLOW FOR 45NM AND BELOW

An electrical chip problem can be rooted due to crosstalk and IR drop. Therefore, it is necessary to integrate the noise and IR drop analysis into a single flow as shown in *Figure 9*. Following 3-D parasitic extraction, IR drop analysis is performed to determine the worst-case IR drop for each cell instance. This information is then used as input to SI aware STA and for crosstalk-glitch analysis



Fig. 7 Crosstalk Repair Techniques



Fig. 8 Design sign off flow

Since SI-aware STA is dependent on IR drop, it is performed after IR drop analysis. Unlike traditional staticiming sign-off, SI-aware timing sign-off accurately calculates delays in the presence of noise (both crosstalk and IR drop), resolves the interdependency between crosstalk and timing (arrival times and slews), and avoids overly pessimistic approximations. SI-aware timing analysis must also filter realistic combinations of SI events, either automatically or user control. A timing sign-off flow that blindly uses worst-case data and accumulates all SI delay effects likely will zeader an unattainable timing target. Finally, the SI-aware off timing analysis must be supported with the library necessary to permit accurate calculation of SI effects. data must be readily available from library vendors, and or easily created via automated library characterization Emis.

As crosstalk glitch analysis is dependent on both converged timing windows and slews from SI-aware STA and IR drop analysis, this check is performed last.

#### VII. CONCLUSIONS

Physical IC verification for shrinking design 45 nm and below requires advanced tools and methodologies in order to cover all its effects. Its success also requires a new set of implementation, analysis, and database technologies.

Silicon integrity and IR drop have become most important factor in measuring of timing effects due to shrinking 45 nm and below design, and in such case, EM is also an issue for signals as well as the power grid. An extensible, unified database provides the foundation for nanometer design, especially since most designs are be digital/mixed-signal. It needs to support a rich set of objects, attributes, and relationships. Perhaps more importantly, it must support extensibility with native performance. The database and all nanometer tools should support hierarchy elegantly and handle 100M or more gate designs efficiently.

No Doubt, shrinking design implementation places extraordinary demands on design teams to calculate power rail electro migration (static), IR drop impact on timing (dynamic), optimal size & location of de-coupling capacitance (dynamic) and to do crosstalk analysis to measure it effects on timing and chip functionality.

#### REFERENCES

- J.T. Kao and A.P. Chandrakasan, "Dual-threshold voltage techniques for low power digital circuits," *IEEE J. Solid-State Circ.*, pp. 1009-1018, Jul. 2000.
- [2] T. Kuroda, et al., "A 0.9V, 150MHz, 10mW, 4mm2, 2-DCT core processor with variable VT scheme," *IEEE J. Solid-State Circ.*, pp. 1770-1778, Nov. 1996.
- [3] H. Kawaguchi, et al., "A CMOS scheme for 0.5V supply voltage with pico-ampere standby current," Proc. ISSCC, pp. 192-193, 1998.
- [4] M.C. Johnson, et al., "Leakage control with efficient use of transistor stacks in single threshold CMOS," Proc. DAC, pp. 442-445, 1999.
- [5] L. Wei, et al., "Design and optimization of dual-threshold circuits for low-voltage low-power applications," *IEEE T. VLSI Sys*, pp. 16-24, Mar. 1999.
- [6] S. Tyagi, et al., "A 130nm generation logic technology featuring 70nm transistors, dual-Vt transistors and 6 layers of Cu interconnects," Proc. IEDM, pp. 567 - 570, 2000.